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Abstract. We say that a collection C of subsets of X is bureaucratic if every maximal hierar- 
chy on X contained in C is also maximum. We characterise bureaucratic set systems and show 
how they arise in phylogenetics. This framework has several useful algorithmic consequences: 
we generalize some earlier results and derive a polynomial-time algorithm for a parsimony 
problem arising in phylogenetic networks. 



1. Bureaucratic sets and their characterization 

We first recall some standard phylogenetic terminology (for more details, the reader can 
consult fSI ) . Recall that a hierarchy H on a finite set X is a collection of sets with the property 
that the intersection of any two sets is either empty or equal to one of the two sets; we also 
assume that X eH. 




Figure 1. (a): A rooted tree T with leaf set X = {1,2,3,4,5}, and 
with cluster set c(T) being equal to the hierarchy H consisting of the sets 
{1, 2, 3}, {4, 5} and the trivial clusters, (b): A binary tree T with a cluster 
set consisting of ?^ U {{1,2}}. (c): A binary and planar phylogenetic net- 
work Af over X — {1,2,3,4} with a soft-wired cluster set sw{N') consisting 
of {1, 2}, {2, 3}, {3, 4}, {1, 2, 3}, {2, 3, 4} and the trivial clusters. 



A hierarchy is maximum if \T-L\ = 2|A| — 1, which is the largest possible cardinality. In this case 
H corresponds to the set of clusters c(T) of some rooted binary tree T with leaf set X (a cluster 
of T is the set of leaves that are separated from the root of the tree by any vertex). A maximum 
hierarchy necessarily contains {x} for each x £ X, a.s well as X itself; we will refer to these 
|A| + 1 sets as the trivial clusters of X. More generally, any hierarchy containing all the trivial 
clusters corresponds to the clusters c{T) of a rooted tree T with leaf set X (examples of these 
concepts are illustrated in Fig. [TJa),(b)). Note that a hierarchy H is maximum if and only if (i) 
H contains all the trivial clusters, and (ii) each set C G "H of size greater than 1 can be written 
as a disjoint union C — A\J B, for two (disjoint) sets A,B E H. 
We now introduce a new notion. 
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Definition: We say that a collection C of subsets of a finite set X is a bureaucracy if (i) C 7^ 
and ^ C, and (ii) every hierarchy T-l Q C can be extended to a maximum hierarchy Ti' such 
that TL CJi' (I C. In this case, we say that the collection is bureaucratic. 

Simple examples of bureaucracies include two extreme cases: the set of clusters of a binary 
tree, and the set ViX) of aU non-empty subsets of X. Notice that {{a}, {&}, {c}, {a, 6}, {a, b, c}} 
and {{a},{b},{c},{b,c},{a,b,c}} are both bureaucratic subsets of X = {a, &, c} but their in- 
tersection, {{a}, {6}, {c}, {a, &, c}}, is not. In particular, for an arbitrary subset Y of X (e.g. 
Y = {{a}, {&}, {c}, {a, 6, c}}), there may not be a unique minimal bureaucratic subset of X 
containing Y. 

In the next section we describe a more extensive list of examples, but first we describe some 
properties and provide a characterization of bureaucracies. In the following lemma, given two 
sets A and B from C we say that B covers Aii A C B and there is no set C G C with A C C C B. 

Lemma 1. If C is bureaucratic then: 

(i) For any pair A, B C, if B covers A then B — A E C. 

(ii) For any C £ C with \C\ > 1, we can write C ^ AU B for (disjoint) sets A, B G C. 

Proof. For Part (i), suppose that A,B € C and that B covers A. Let H — {A,B}. Then H is 
a hierarchy that is contained within C and so there exists a maximum hierarchy Ji' C C that 
contains Note that A must be a maximal sub-cluster of B in "H' (as otherwise B does not 
cover A) which requires that B — A is a. cluster of H' and thereby an element of C. 

For Part (ii), observe that the set H = {C} is a hierarchy, and the assumption that C is 
bureaucratic ensures the existence of a maximum hierarchy "H' C C containing H, and so H' 
contains the required sets A, B. □ 

Note that the conditions described in Parts (i) and (ii) of Lemma [U while they are necessary 
for C to be a bureaucracy, are not sufficient. For example, let X = {1, 2, 3, 4, 5, 6} and let C be 
the union of 

{{1,2}, {3, 4}, {5, 6}, {1,2,3}, {4, 5, 6}, {3, 4, 5}, {1,2,6}, {1,5, 6}, {2,3, 4}} 

with the set of the seven trivial clusters. Then C satisfies Parts (i) and (ii) of Lemma [TJ yet C is 
not bureaucratic since Ti — {{1, 2}, {3, 4}, {5, 6}} does not extend to a maximum hierarchy on 
X using just elements from C. 

Theorem 2. A collection C of subsets of X is bureaucratic if and only if it satisfies the following 
two properties: 

• (PI) C contains all trivial clusters of X. 

• (P2)//{Ci, C2, . . . , Ck} Q C are disjoint and have union UiC'i in C then there are distinct 
i,j such that Ci U Cj G C. 

Proof. First suppose that C is bureaucratic. Then C contains a maximal hierarchy; in particular, 
it contains all the trivial clusters, and so (PI) holds. For (P2), suppose that C is a collection 
of /c > 3 disjoint subsets of X, each an element of C, and IJC G C. Then H = C U {IJC} is 
a hierarchy. Let C C be a maximal hierarchy on X that contains T-L (this exists, since C is 
bureaucratic) and let C be a minimal subset of X in %' that contains the union of at least two 
elements of C . Since %' is a binary hierarchy, and IJC S H', C is precisely the union of exactly 
two elements of C; since C G "H' C C, this establishes (P2). 

Conversely, suppose that a collection C of subsets of X satisfies (PI) and (P2), and that 
7^ C C is a maximal hierarchy which is contained within C. Suppose that H. is not maximum 
(we will derive a contradiction). Then % contains a set C that is the disjoint union of fc > 3 
maximal proper subsets Ai, . . . ^ Ak, each belonging to Ti, (and thereby C). Applying (P2) to 
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C — {Ai, . . . ,Ak], there exists two sets, say Ai,Aj for which Ai U Aj e C. So, if we let H' = 
T-L U {Ai U Aj}, then we obtain a larger hierarchy containing T-L that is still contained within C, 
which is a contradiction. This completes the proof. □ 

2. Examples of bureaucracies 

We have mentioned two extreme cases of bureaucracies, namely the set of clusters of a binary 
X— tree and the full power set V{X). Here are some further examples. 

(1) The set of intervals of [n] — {l,2,...,n} is a bureaucracy, where an interval is a set 

= {k : i < k < j}, I < i < j < n. 

Proof. Let C be the set of intervals of [n]. Then C contains the trivial clusters. Also, a 
disjoint collection Ii, . . . ,Ik, fc > 2 of intervals has union an interval if and only if every 
element of [n] between minlj Ij and maxlj Ij lies in (exactly) one interval, in which case 
the union of any pair of consecutive intervals is an interval, so (P2) holds. By Theorem 
m C is bureaucratic. □ 

Similarly, if we order the elements of X in any fashion, we can define the set of intervals 
on X for that ordering by this construction (associating Xi with i), and can thus obtain 
a bureaucracy. 

A natural question at this point is the following: Does the extension of intervals in a 
1-dimensional lattice (Example 1) to rectangles in a 2-dimensional lattice also necessarily 
lead to bureaucracies? The answer is 'no' because condition (P2) can be violated due 
to the existence of subdivisions of integral sized rectangles into fc > 2 disjoint squares of 
different integral sizes, the union of any two of which must therefore fail to be a rectangle 
(see e.g. [2]). 

(2) Let T be a rooted tree (generally not binary) with leaf set X and let C be the set of all 
clusters compatible with all the clusters in c(T). Then C is bureaucratic. 

Proof. We have C = {C C X : C nC e {C,C' , 0} for aU C & c{T)}. C is also the set of 
clusters that occur in at least one rooted phylogenetic X— tree that refines T, that is: 

c= U c(r'). 

T' ■.c(T)Cc{T') 

Suppose that 7^ C C is a hierarchy on X. Then T-L U c(T) is also a hierarchy on X since 
every element of Ti is compatible with every element of c(T). Let H' be any maximal 
hierarchy on X containing H. Then since c{T) C H', we have H' C C, and so, by 
definition, C is a bureaucracy. □ 

(3) Let C be a collection of subsets of X that includes the trivial clusters and which satisfies 
the condition: 

(1) A,B eC and An B AUB eC. 

Then C is bureaucratic if and only if C satisfies the covering condition in Lemma [IJi) . 

Condition ([l} is a weakening of the condition required for a 'patchwork' set system on 
X due to Andreas Dress and Sebastian Bocker (see e.g. [6^, where the covering condition 
of Lemma [T](i) leads to an 'ample patchwork'). 

Proof. The 'only if part follows from Lemma [Hi). Conversely, suppose that (IT|) holds 
for a set system C that includes all the trivial clusters of X and that satisfies the covering 
condition of Lemma [iji). Suppose that H C C is a maximal hierarchy contained within 
C. We show that H is maximum. Suppose that this is not the case - we will derive a 
contradiction (by constructing a larger hierarchy H' containing H but still lying within 
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C). The assumption that H is not maximum imphes that there exists a set B d H which 
is the union of three or more disjoint sets Ai, A2, A^, . . . , Ak, where Ai ^ H (since the 
rooted tree associated with H has a vertex of degree k > 3). We consider two cases: 

(i) B covers none of the sets from Ai, A2, A3, . . . , Ak- 

(ii) B covers one of the sets from Ai, A2, A3, . . . , Ak- 

We first show that Case (i) cannot arise under Condition ([T]). Suppose to the contrary 
Case (i) arises. Then for i — 1, . . .k there exists a set Ci eC that contains Ai and which 
is covered by B. For any pair i,j with i ^ j , ii [B — Ci) r\ Cj = then Cj C d. On the 
other hand, if (S - C^) n Cj ^ then, by Condition (P), {B-Ci)UCj G C, which means 
that B = {B — Ci) U Cj (otherwise {B — Ci) U Cj an element of C strictly containing 
Cj and strictly contained by B) and so Ci C Cj. Thus Case (i) requires that either 
Ci C Cj or Cj C d, which implies (again by the assumption that B covers C, and B 
covers Cj) that Ci = Cj. Since this identity holds for all distinct pairs i,j it follows that 
Ci, C2, ■ • ■ , Cfc are the same set C and this set contains lJi=i (since Ai C Ci). But 
then B = lJi=i ^ C which contradicts the assumption that B covers Ci{— C). 

Thus only Case (ii) can arise. In this case, suppose that B covers Ai. By assumption 
that C satisfies the covering condition described in Lemma [^i), B — Ai e C holds, and 
so we can take H' = HU {B — Ai} which provides the required contradiction. 

□ 

(4) Let G = {X, E) be a connected graph. Let C be the set of subsets Y X such that G\V] 
is connected (where C\Y] is the subgraph formed by deleting vertices not in Y , together 
with their incident edges). Then C is bureaucratic. 

Observe that taking G to be a linear graph recovers Example (1). 

Proof. First note that C satisfies (PI), since G itself is connected, as is each vertex by 
itself. Now suppose that Ai, . . . ,Ak, k > 2, are disjoint clusters in C whose union, A, 
is also in C. As G[A] is connected, at least two clusters Ai, Aj must contain adjacent 
vertices, in which case G[Ai U Aj] is connected and Ai U Aj E C. The result now follows 
by Theorem [51 

An alternative proof is to apply Example (3) and note that C satisfies Condition ([1]) 
and the covering condition of Lemma [T](i). □ 

3. Algorithmic applications 

3.1. MELximum weight hierarchies. In general, the problem of finding the largest hierarchy 
contained within a set of clusters is NP-hard 3] . The problem becomes trivial in a bureaucratic 
collection since all maximal hierarchies are maximum. Less obvious, however, is the fact that 
finding a hierarchy with maximum weight can also be solved in polynomial time. 

Theorem 3. Let C be a bureaucratic collection of clusters on X and let w : C — > R be a weight 
function on C. The problem of finding the hierarchy H C C such that w{'H) — '^j^^'^w{A) is 
maximized can be solved in polynomial time. 

Proof. If there are any clusters A £ C with negative weight w{A) then set their weights to zero. It 
follows then that the weight of any maximum hierarchy % Q C equals the weight of the maximum 
weight hierarchy contained within %. The 'Hunting for Trees' algorithm of [T] can now be used 
to recover the maximum hierarchy of maximum weight. □ 

3.2. Parsimony problems on networks. Consider a set C of clusters on X and \ei f : X ^ A 

be a function that assigns each element x E X a. state f{x) in a finite set A (/ is referred to in 
phylogenetics as a (discrete) character). Suppose we have a non- negative function 5 on Ay. A 
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where S{a, b) assigns a penalty score for changing state a to 6 for each pair a,b Cz A (the default 
option is to to take d{a, 6) = 1 for all a b and 6{a, a) = for all a). 

Given any rooted X— tree T, with vertex set V and arc set E, let /(/, T, S) denote the parsimony 
score of / on T relative to S; that is, 

{iu,v)eE 

In words, l{f,T,d) is the minimum sum of <5-penalty scores that are required in order to extend 
/ to an assignment of states to all the vertices of T. This quantity can be calculated for a given 
T by well-known dynamic programming techniques (see e.g. [5]). Let l{f,C,S) (respectively, 
^bm(/, C)) denote the minimal value of /(/, T, S) among all trees T (respectively, all binary trees) 
that have their clusters in C. Then we have the following general result. 

Theorem 4. Suppose that C is contained within a bureaucratic collection C of subsets of X 
and f : X ^ A. There is an algorithm for computing l{f,C,d) with running time polynomial in 
n = \X\, \A\ and \C'\. Moreover, the algorithm can be extended to construct a rooted phylogenetic 
X—tree having all its clusters in C and with parsimony score equal to l{f,C,d) in polynomial 
time. 



Proof. For any subset Y of X, let 



\S{a,b), ifFeC; 

(5y(a,6) = <0, ify^Canda = 6; 

[oo, otherwise; 
and for any rooted phylogenetic X—tree T, let 

^'^^^^'^^'■= F-v^'^'^lx-fl ^ S,iv){F{u),F{v)) 

where c{v) is the cluster of T associated with v. 

Let l'{f,C',6) (respectively, l[^^^{f,C' ,6)) be the minimal value of l'{f,T,S) over all trees 
(respectively, all binary trees) with clusters in C . By the definition of Sy, we have: 



(2) lif,C,S)^l'{f,C',6), 
and by the assumption that C is bureaucratic we have: 

(3) l'{f,C',5)^l{,,M,C\S). 

We now describe how l'^.^^^{f,C',S) can be efficiently calculated by dynamic programming. 

For an element a ^ A and Y e C, let L'{Y, a) be the minimum value of l'{f\Y, T, 5) across all 
binary trees T having leaf set Y and clusters in C, in which the root is assigned state a. 

For |F| = 1, say Y = {y}, we have 



L'{Y,a) 



if /(y) = a; 

otherwise 



and for |y| > 1, we have 



(4) L'(y,a)= min {L'{Yi,ai) + SY^{a,ai) + L'{Y2,a2) + SY,{a,a2) :YiUY2 =Y} . 

Yi,Y2eC' ,ai,a2eA 
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Now, 

lU{f,C'S)^TmnL'{X,a). 

aeA 

Notice that when one evaluates L'{X,a) usmg the above recursion (Eqn. (|1])), it is sufficient to 
compute L'{Y, a) for just the sets Y G C rather than ah subsets of X. 

Thus, in view of Eqns. ^ and ([3]), one can compute l{f,C,d) in time polynomial in n = 
\X\, \ A\ and \C'\. Moreover, by suitable book-keeping along the way, one can construct a rooted 
binary phylogenetic X— tree with clusters in C and with a parsimony score equal to Zbin(/, C , 6); 
by collapsing all edges of this tree that have a 5-score equal to we obtain a rooted phylogenetic 
X— tree with clusters in C and with parsimony score equal to l{f, C,S). □ 

We note that this result has been described in the particular case when C is the bureaucracy 
described in example (2) above, and where / maps to a set A with only two elements [5]. We 
provide a second application, to phylogenetic networks, based on Example (1) above, of intervals 
as bureaucratic set systems. 

Let TV be a rooted binary phylogenetic network on X. We say that Af is planar if it can be 
drawn in the plane so that all the leaves and the root all lie on the outer face. Let sw{Af) denote 
the set of 'soft-wired' clusters in TV (the union of the cluster sets of all trees embedded in TV; see 
e.g. [4]). A simple example is shown in Fig. [Ijc). 

Corollary 5. Suppose thatAf is a binary and planar phylogenetic network on X , and f : X ^ A. 
There is an algorithm for computing l{f, sw{J\f)) with running time polynomial time in n. 

Proof. Let xi , . . . , x„ be the ordering of X given by their positions around the outer face in a 
planar embedding of TV, where xi and Xn come immediately after and before the root. Then any 
tree T embedded in TV can be ordered so that the leaves are in order xi, . . . , Xn, implying that 
the clusters of T are all of the form {xi, Xi+i, . . . ,Xj} for some 1 < i < j < n. It follows that 
the set sw{J\f) is contained in the set of intervals oi X = {xi, . . . ,Xn} (Example 1, above). The 
corollary now follows from Theorem |4l □ 

We end this paper by posing a computational problem. 

Question. Is there an algorithm for deciding whether or not C is bureaucratic that runs in time 
polynomial in |C| and \X\? 
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